Search CORE

8 research outputs found

Decision-Making in Autonomous Driving using Reinforcement Learning

Author: Hoel Carl-Johan E
Publication venue
Publication date: 01/01/2021
Field of study

The main topic of this thesis is tactical decision-making for autonomous driving. An autonomous vehicle must be able to handle a diverse set of environments and traffic situations, which makes it hard to manually specify a suitable behavior for every possible scenario. Therefore, learning-based strategies are considered in this thesis, which introduces different approaches based on reinforcement learning (RL). A general decision-making agent, derived from the Deep Q-Network (DQN) algorithm, is proposed. With few modifications, this method can be applied to different driving environments, which is demonstrated for various simulated highway and intersection scenarios. A more sample efficient agent can be obtained by incorporating more domain knowledge, which is explored by combining planning and learning in the form of Monte Carlo tree search and RL. In different highway scenarios, the combined method outperforms using either a planning or a learning-based strategy separately, while requiring an order of magnitude fewer training samples than the DQN method. A drawback of many learning-based approaches is that they create black-box solutions, which do not indicate the confidence of the agent\u27s decisions. Therefore, the Ensemble Quantile Networks (EQN) method is introduced, which combines distributional RL with an ensemble approach, to provide an estimate of both the aleatoric and the epistemic uncertainty of each decision. The results show that the EQN method can balance risk and time efficiency in different occluded intersection scenarios, while also identifying situations that the agent has not been trained for. Thereby, the agent can avoid making unfounded, potentially dangerous, decisions outside of the training distribution. Finally, this thesis introduces a neural network architecture that is invariant to permutations of the order in which surrounding vehicles are listed. This architecture improves the sample efficiency of the agent by the factorial of the number of surrounding vehicles

Chalmers Research

Tactical decision-making for autonomous driving: A reinforcement learning approach

Author: Hoel Carl-Johan E
Publication venue
Publication date: 01/01/2019
Field of study

The tactical decision-making task of an autonomous vehicle is challenging, due to the diversity of the environments the vehicle operates in, the uncertainty in the sensor information, and the complex interaction with other road users. This thesis introduces and compares three general approaches, based on reinforcement learning, to creating a tactical decision-making agent. The first method uses a genetic algorithm to automatically generate a rule based decision-making agent, whereas the second method is based on a Deep Q-Network agent. The third method combines the concepts of planning and learning, in the form of Monte Carlo tree search and deep reinforcement learning. The three approaches are applied to several highway driving cases in a simulated environment and outperform a commonly used baseline model by taking decisions that allow the vehicle to navigate 5% to 10% faster through dense traffic. However, the main advantage of the methods is their generality, which is indicated by applying them to conceptually different driving cases. Furthermore, this thesis introduces a novel way of applying a convolutional neural network architecture to a high level state description of interchangeable objects, which speeds up the learning process and eliminates all collisions in the test cases

Chalmers Research

Tactical Decision-Making in Autonomous Driving by Reinforcement Learning with Uncertainty Estimation

Author: Hoel Carl-Johan E
Laine Leo
Wolff Krister
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

Reinforcement learning (RL) can be used to create a tactical decision-making agent for autonomous driving. However, previous approaches only output decisions and do not provide information about the agent\u27s confidence in the recommended actions. This paper investigates how a Bayesian RL technique, based on an ensemble of neural networks with additional randomized prior functions (RPF), can be used to estimate the uncertainty of decisions in autonomous driving. A method for classifying whether or not an action should be considered safe is also introduced. The performance of the ensemble RPF method is evaluated by training an agent on a highway driving scenario. It is shown that the trained agent can estimate the uncertainty of its decisions and indicate an unacceptable level when the agent faces a situation that is far from the training distribution. Furthermore, within the training distribution, the ensemble RPF agent outperforms a standard Deep Q-Network agent. In this study, the estimated uncertainty is used to choose safe actions in unknown situations. However, the uncertainty information could also be used to identify situations that should be added to the training process

Crossref

Chalmers Research

Ensemble Quantile Networks: Uncertainty-Aware Reinforcement Learning With Applications in Autonomous Driving

Author: Hoel Carl-Johan E
Laine Leo
Wolff Krister
Publication venue
Publication date: 01/01/2023
Field of study

Reinforcement learning (RL) can be used to create a decision-making agent for autonomous driving. However, previous approaches provide black-box solutions, which do not offer information on how confident the agent is about its decisions. An estimate of both the aleatoric and epistemic uncertainty of the agent’s decisions is fundamental for real-world applications of autonomous driving. Therefore, this paper introduces the Ensemble Quantile Networks (EQN) method, which combines distributional RL with an ensemble approach, to obtain a complete uncertainty estimate. The distribution over returns is estimated by learning its quantile function implicitly, which gives the aleatoric uncertainty, whereas an ensemble of agents is trained on bootstrapped data to provide a Bayesian estimation of the epistemic uncertainty. A criterion for classifying which decisions that have an unacceptable uncertainty is also introduced. The results show that the EQN method can balance risk and time efficiency in different occluded intersection scenarios, by considering the estimated aleatoric uncertainty. Furthermore, it is shown that the trained agent can use the epistemic uncertainty information to identify situations that the agent has not been trained for and thereby avoid making unfounded, potentially dangerous, decisions outside of the training distribution

Chalmers Research

An Evolutionary Approach to General-Purpose Automated Speed and Lane Change Behavior

Author: Hoel Carl-Johan E
Wahde Mattias
Wolff Krister
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

This paper introduces a method for automatically training a general-purpose driver model, applied to the case of a truck-trailer combination. A genetic algorithm is used to optimize a structure of rules and actions, and their parameters, to achieve the desired driving behavior. The training is carried out in a simulated environment, using a two-stage process. The method is then applied to a highway driving case, where it is shown that it generates a model that matches or surpasses the performance of a commonly used reference model. Furthermore, the generality of the model is demonstrated by applying it to an overtaking situation on a rural road with oncoming traffic

Chalmers Research

Chalmers Publication Library

Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections

Author: Hoel Carl-Johan E
Sj\uf6berg Jonas
Tram Tommy
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

This paper investigates how a Bayesian reinforcement learning method can be used to create a tactical decision-making agent for autonomous driving in an intersection scenario, where the agent can estimate the confidence of its decisions. An ensemble of neural networks, with additional randomized prior functions (RPF), are trained by using a bootstrapped experience replay memory. The coefficient of variation in the estimated Q-values of the ensemble members is used to approximate the uncertainty, and a criterion that determines if the agent is sufficiently confident to make a particular decision is introduced. The performance of the ensemble RPF method is evaluated in an intersection scenario and compared to a standard Deep Q-Network method, which does not estimate the uncertainty. It is shown that the trained ensemble RPF agent can detect cases with high uncertainty, both in situations that are far from the training distribution, and in situations that seldom occur within the training distribution. This work demonstrates one possible application of such a confidence estimate, by using this information to choose safe actions in unknown situations, which removes all collisions from within the training distribution, and most collisions outside of the distribution

Crossref

Chalmers Research

Design and Evaluation of a Customizable Multi-Domain Reference Architecture on top of Product Lines of Self-Driving Heavy Vehicles – An Industrial Case Study

Author: Berger Christian
Hoel Carl-Johan E
Holzner Daniela
Laine Leo
Magnusson Anders
Schr\uf6der Jan
Publication venue
Publication date: 01/01/2015
Field of study

Self-driving vehicles for commercial use cases like logistics or overcast mines increase their owners\u27 economic competitiveness. Volvo maintains, evolves, and distributes a vehicle control product line for different brands like Volvo Trucks, Renault, and Mack in more than 190 markets world-wide. From the different application domains of their customers originates the need for a multi-domain reference architecture concerned with transport mission planning, execution, and tracking on top of the vehicle control product line. This industrial case study is the first of its kind reporting about the systematic process to design such a reference architecture involving all relevant external and internal stakeholders, development documents, low level artifacts, and literature. Quantitative and qualitative metrics were applied to evaluate non-functional requirements on the reference architecture level before a concrete variant was evaluated using a Volvo FMX truck in an exemplary construction site setting

Chalmers Research

Select Bibliography of Contributions to Economic and Social History Appearing in Scandinavian Books, Periodicals and Year-books, 1986

Author: Ahnstrøm Leif
Alestalo Matti
Andersen Bent Schirmer
Andersen Lise
Andersen Per Sveaas
Andersen Winni
Anderson Bertil
Andersson Hans
Aslaksøy Truls
Attman Artur
Attman Artur
Attman Artur
Aubert Axel B.
Balsvik Randi R0nning
Barlese Borge L.
Bee Per
Beijbom Ulf
Benedictow Ole Jörgen
Bengtsson Tommy
Bengtsson Tommy
Berg Magnus
Bjerve Petter Jakob
Björklund Jörgen
Björkqvist Heimer
BJørkhaug Birger
Bjørklund Ivar
Bjørn Claus
Bjørnsen Björn
Blüdnikow Bent
Blüdnikow Bent
Boje Per
Boje Per
Boström Rolf
Bottolfsen Øystein
Bratrein Håvard Dahl
Bro Per
Bruhn Verner
Bull Edvard
Bull Ida
Böhme Klaus-Richard
Carlson Benny
Carlson Sune
Cederberg John
Christensen Arne Lie
Christensen Elisabeth Riber
Christiansen Niels Finn
Christoffersen Knud B.
Clausen H.R
Cornell Lasse
Cornell Lasse
Cornell Lasse
Dahl Torsten
Dancke Trond M.E.
Danielsson Rolf
Dannevig Hartvig
Dombernowsky Lotte
Dreijer Matts
Drivenes Einar-Arne
Dybdahl Annegrete
Ekdahl Lars
Ellingsen Harry
Endén R.
Endén Rauno
Engberg Jens
Engelbertsson Bob
Erävuori Jukka
Essemyr Mats
Ewaldson Stig
Feldbaek Ole
Feldbæk Ole
Finnäs Fjalar
Fladby Rolf
Floystad Ingeborg
Fløystad Ingeborg
Fløystad Ingeborg
Fossen Anders Bjarne
Fritz Martin
Fuglestad Finn
Fure Eli
G0bel Erik
Gardell Carl Johan
Gelotte Göran
Giverholt Helge
Glete Jan
Granberg Leo
Grelle Henning
Grill Erik
Guillemot Agneta
Gustafsson Bo
Gustafsson Nils
Gustavson Erik
Haapala Periti
Hagberg Jan-Erik
Hallvarsson Mats
Hanisch Tore Jörgen
Hansen Bo
Hansen Hans Schultz
Hansen Lars Ivar
Hansen Lars Ivar
Harnesk Börje
Hatti Neelambar
Hauglid Anders
Havro Olav og
Heikkinen Sakari
Heikkinen Sakari
Helldin Arne
Helles Finn
Herlitz Lars
Herlitz Urban
Hertz Michael
Hildebrand Karl-Gustaf
Hjejle Benedicte
Hodne Bjarne
Hoel Kari
Hoem Arne J.
Hoff Stein
Holmsen Andreas
Horgby Björn
Hovland Edgarog Næss
Huttunen Periti
Huurre Matti
Hyrkkänen Markku
Höjfors Hong Margot
Højrup Ole
Jacobsen Grethe
Jensen Hannelene Toft
Jensen L.E. Fauerholdt
Jern Henrik
Johansen Hans Christian
Johansen Marianne
Jonsson Sigfus
Julku Kyösti
Just Flemming
Jutikkala Eino
Jutikkala Eino
Kaad Hanne Holmbo
Kaijser Arne
Kalela Jorma
Kallenautio J.
Kananen Ilkka
Kannisto Väinö
Karlsson Lynn
Karlsson Per
Kekkonen Jukka
Kettunen Pauli
Kirkinen H.
Klang Lennart
Klemmensen Kirsten
Knudsen Ann Vibeke
Koivukangas Olavi
Kolltveit Bård
Kolltveit Bård
Korsgaard Peter
Krantz Olle
Krantz Olle
Kræmmer Michael
Kuhnle Stein
Kuisma Markku
Kuuse Jan
Kvaale Reidun
Köll Ann-Mai
Lange Even
Lange Ole
Larsen Øyvind
Larsson Mats
Larsson Mats
Lauridsen Henning Ringgaard
Liedman Sven-Eric
Liedman Sven-Eric
Ljungkvist Carsten
Lohfelt Elsebeth
Lorentzen Harald
Losman Beata
Luihn Hans
Lund Jörgen
Lundström Ragnhild
Løgstrup Birgit
Magnusson Lars
Magnusson Lars
Magnusson Lars
Magnusson Magnus
Marstränder Sverre
Martensen-Lar sen Florian
Melchiorsen Hanne
Mendelsohn Oskar
Mikkelsen Birger
Mikkelsen Flemming
Mikkelsen Jørgen
Minde Kjell Bjørn
Morell Mats
Myhre Jan Eivind
Myhrman Johan
Mykland Knut
Myllyntaus Timo
Mørch Søren
Nagel Anne-Hilde
Nerbøvik Jostein
Nerheim Gunnarog Nordvik
Nielsen Beth Grothe
Nielsen Inge
Nielsen Sigurd
Nielsen Torben Hviid
Nielssen Alf Ragnar
Niemi Einar
Nilsson Anders
Nilsson Göran B.
Norborg Lars-Arne
Nordvik Helge W.
Nummela Ilkka
Nybom Thorsten
Nybom Thorsten
Näreikkö Heikki
Oakley Stewart P.
Ohlander Ann-Sofie
Ohlsson Rolf
Olsson Ulf
Olsson Ulf
Orrman Eljas
Otte Helle
Ovaskainen Ville
Patoluoto Ilkka
Pedersen Erik Helmer
Pedersen Finn Stendal
Pedersen J. Ingemann
Peltonen Matti
Pennanen Jukka
Petersen E. Ladewig
Petersen Erling Ladewig
Petersen Holger Munchaus
Peterson Martin
Pettersson Ronny
Pfeiffer Riccarda
Pharo Helge Øystein
Pihkala Erkki
Pitkänen Kari
Pitkänen Kari
Posselt Gert
Rafner Claus
Rasmussen Hans-Erik
Reksten Erling
Rinne Risto
Rogan Bjarne
Rojas Mauricio
Rooth T.J.T.
Ruud Marit Ekne
Rönkkö Marja-Liisa
Rønning Bjørn R.
Saarinen Veikko
Sandvik Gudmund
Saskolskij LP
Schuller Bernd-Joachim
Schybergson Per
Sejersted Francis
Sidenius Niels Christian
Siiskonen H.
Soikkanen Hannu
Solvang Gunnar
Stenholm Leifh
Strand Sigvard
Strömstad Poul
Sulkunen Irma
Suolinna Kirsti
Svennevig Palle
Sætra Gustav
Taimio Hilkka
Takala Tuomas
Takala Tuomo
Taussi Sjöberg Marja
Taussi Sjöberg Marje
Tedebrand Lars-Göran
Tervo Mikko
Tollin Clas
Torp Eivind
Torstendahl Rolf
Torstenson Inge
Tuomisto Jukka
Turpeinen Oiva
Turpeinen Oiva
Tveite Stein
Tveite Stein
Tönnesson Kåre
Ullenhag Kersti
Vammen Tinne
Virrankoski Pentti
Virtanen Keijo
Vollan Odd
Wallentin Hans
Wallmark Torkel
Welling Ann R.
Wendt Carl-Magnus
Westerlind Ann Marie
Widgren Mats
Wiklander Örjan
Wiklander Örjan
Ylikangas Heikki
Zetterberg Kent
Zwilgmeyer Valdis
Åkerman Sune
Österberg Eva
Österberg Eva
Ørberg Paul G.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref